AppL No. : 10/067,550 

Filed : February 5, 2002 



AMENDMENTS TO THE CLAIMS 

1 . (Original) A method of extracting audio for indexing of video, comprising: 
receiving video information having embedded audio information and associated time 

codes; 

capturing the embedded audio information in the video information; 

extracting a plurality of audio metadata tracks from the audio information, each audio 
metadata track having selected ones of the time codes indicative at least of start and stop times 
for the audio metadata track; 

encoding the video information; and 

accessing the encoded video information with the selected time codes of one of the audio 
metadata tracks. 

2. (Original) The method of Claim 1, wherein the video information is received 
from an analog source. 

3. (Original) The method of Claim 2, wherein the analog sowce is a videotape 

deck. 



4. (Original) The method of Claim 2, wherein the analog source is a live satellite 

feed. 

5 . (Original) The method of Claim 1 , wherein the video information is received 
from a digital source. 

6. (Currently amended) The method of Claim 1 , wherein the capturing includes 
digitizing with an audio digitization devio e .s devices. 
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7. (Original) The method of Claim 1 , wherein the plurality of audio metadata 
tracks includes at least one of: keywords, speech-to-text transcription, speaker identification and 
audio class. 

8. (Original) The method of Claim 1 , wherein the time codes comprise SMPTE 

codes. 

9. (Original) The method of Claim 1 , wherein the encoding comprises encoding 
with an MPEG format. 

1 0. (Original) An audio engine for extracting metadata tracks, comprising: 
an audio signal switch receiving an audio signal; 

an audio classification component controlling the audio signal switch according to 
whether the audio signal is classified as speech; and 

a plurality of audio metadata track extraction components in data communication with the 
output of the switch, wherein each audio metadata track extraction component provides an audio 
metadata track associated with speech. 

1 1 . (Original) The audio engine of Claim 1 0, additionally comprising: 

an audio capture component for capturing and digitizing an analog audio source; and 
an audio signal normalization component for normalizing the digitized audio prior to 
processing. 

12. (Original) The audio engine of Claim 1 0, wherein the audio metadata tracks 
include at least one of: keywords, speech-to-text transcription and speaker identification; 

1 3 . (Original) The audio engine of Claim 1 0, wherein the audio classification 
component additionally classifies at least silence and music. 
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1 4. (Original) The audio engine of Claim 1 0, wherein the audio metadata track 
extraction components receive data from a customizable dictionary. 

15. (Original) The audio engine of Claim 1 0, wherein the audio signal is received 
from a real-time source. 



16. (Origmal) The audio engine of Claim 10, wherem the audio signal is received 
from a digital source. 

1 7. (Original) The audio engine of Claim 1 0, wherein the audio signal is received 
from a digital camcorder. 

1 8. (Original) An audio engine for extracting metadata tracks, comprising: 
an audio signal switch receivmg an audio signal; 

an audio classification component in data commxmication with and controlling the 
audio signal switch according to whether the audio signal is classified as speech; and 

a plurality of audio metadata track extraction components in data communication 
with the output of the switch, wherein each audio metadata track extraction component 
provides an audio metadata track associated with speech. 

19. (Original) The audio engine of Claim 18, wherein the audio metadata tracks 
include at least speaker identification. 

20. (Original) The audio engine of Claim 18, wherein the audio classification 
component additionally classifies at least music. 

21. (Original) The audio engine of Claim 18, wherein the audio metadata track 
extraction components receive data from a customizable dictionary of data associated with the 
extracted metadata tracks. 
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22. (Original) The audio engine of Claim 18, wherein the audio signal is received 
from a remote real-time source. 

23. (Original) The audio engine of Claim 18, wherein the audio signal is received 
from a remote digital source. 

24. (Original) A method of extracting audio for indexmg of video, comprising: 
receiving video information having embedded audio information and associated 

time codes; 

capturing the embedded audio information in the video information; 

extracting a plurality of audio metadata tracks from the audio information, each 
audio metadata track being associated with selected ones of the time codes indicative at 
least of start and stop times for the audio metadata track; 

encoding the video information; and 

accessing the encoded video information with the selected time codes of one of 
the audio metadata tracks. 

25. (Original) The method of Claim 24, wherein the video information is received 
from a remote digital source. 

26. (Original) The method of Claim 24, wherein the plurality of audio metadata 
tracks includes at least one of: keywords, speech-to-text transcription, speaker identification and 
audio class. 

27. (Original) An audio engine for extracting metadata tracks, comprising: 
an audio signal switch receiving an audio signal; 

an audio classification engine; 

an audio class dictionary configured to provide dictionary data indicative of audio 
classes to the audio classification engine; 
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an audio class profiler in data conununication with the audio classification engine, 
wherein the audio class profiler receives the audio signal, and wherein the audio class 
profiler is further in data communication with and controls the audio signal switch 
according to whether the audio signal is classified as speech; and 

a plurality of audio metadata track extraction components in data communication 
with the ou^ut of the switch, wherem each audio nietadata track extraction component 
provides an audio metadata track associated with speech. 

28. (Original) The audio engine of Claim 27, additionally comprising: 

an audio capture component for capturing and digitizing an analog audio source; 

and 

an audio signal normalization component for normalizing the digitized audio prior 
to processing. 

29. (Original) The audio engine of Claim 27, wherein the audio metadata tracks 
include at least one of: keywords, speech-to-text transcription and speaker identification. 

30. (Original) The audio engine of Claim 27, wherein the audio class profiler 
additionally classifies at least silence and music. 

31. (Original) The audio engine of Claim 27, wherein the audio metadata track 
extraction components receive data fi*om a customizable dictionary of data associated with the 
extracted metadata tracks. 

32. (Original) The audio engine of Claim 27, wherein the audio signal is received 
firom a real-time source. 

33. (Original) The audio engine of Claim 27, wherein the audio signal is received 
firom a digital source. 
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34. (Original) The audio engine of Claim 27, wherein the audio signal is received 
from a digital camcorder. 

35. (Original) The audio engine of Claim 27, wherein the audio class dictionary is 
customizable. 

36. (New) The method of Claim 1, wherein the audio metadata tracks 
comprise different types of audio metadata tracks. 

37. (New) The audio engine of Claim 10, wherein each audio metadata track 
extraction component provides a different type of audio metadata track. 

38. (New) The audio engine of Claim 18, wherein each audio metadata track 
extraction component provides a different type of audio metadata track. 

39. (New) ' The method of Claim 24, wherein the audio metadata tracks 
comprise different types of audio metadata tracks. 

40. (New) The audio engine ofClaim 27, wherein each atidio metadata track 
extraction component provides a different type of audio metadata track. 



